Generation of fundamental frequency contours for Mandarin speech synthesis based on tone nucleus model

نویسندگان

  • Qinghua Sun
  • Keikichi Hirose
  • Wentao Gu
  • Nobuaki Minematsu
چکیده

A new method for generating sentence F0 contours of Mandarin speech is proposed. The method assumes the F0 contour generation process model, but generates the tone and phrase components in different ways and sums them to produce a sentence F0 contour. The tone component is generated concatenating F0 patterns of tone nuclei, which are predicted by a corpus-based scheme (binary decision trees). Experiments of F0 contour generation were conducted by using 100 news utterances by a female speaker. The results showed that the method could generate F0 contours close to those of target speech. A perceptual evaluation was also conducted on the synthetic speech using the F0 contours generated by the method. An average score of 4.5 in a 5-point scale indicates the high naturalness, verifying the validity of the method.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Generation of fundamental frequency contours for Thai speech synthesis using tone nucleus model

As classic and intrinsic requirements, synthetic speech need to convey correct information with good quality of naturalness to listeners. Fundamental frequency (F0) contours need to be controlled to meet these requirements. Additional challenges have been introduced to tonal languages because the F0 contour reflects both intelligibility and naturalness of the speech. According to the fact that ...

متن کامل

Two-step generation of Mandarin F0 contours based on tone nucleus and superpositional models

A 2-step scheme was developed in our method for synthesizing sentence fundamental frequency (F0) contours of Mandarin speech. The method is based on representing a sentence logarithmic F0 contour as a superposition of tone components on phrase components as in the case of generation process model (F0 model). The tone components are realized by concatenating tone nucleus F0 patterns generated by...

متن کامل

Rule-based Generation of Phrase Components in Two-step Synthesis of Fundamental Frequency Contours of Mandarin

In this paper, a rule-based method was developed for realizing phrase components in our two-step generation of fundamental frequency (F0) contours of Mandarin. The scheme assumes (logarithmic) F0 contours as superposition of tone components on phrase components, which are further assumed to be responses of phrase commands. In general, possibility of a new phrase command comes higher at deeper s...

متن کامل

Improved Prediction of Tone Components for F0 Contour Generation of Mandarin Speech Based on the Tone Nucleus Model

Improved prediction of tone components was realized in our method for synthesizing sentence fundamental frequency (F0) contours of Mandarin speech. The method is based on representing a sentence logarithmic F0 contour as a superposition of tone components on phrase components as in the case of generation process model (F0 model). The tone components are realized by concatenating their fragments...

متن کامل

Generation of F 0 Contours for Mandar in Speech in Combination with Rule-based and Corpus-based Methods

A method was developed for synthesizing sentence fundamental frequency (F0) contours of Mandarin speech. It is based on representing an F0 contour in logarithmic frequency scale as a superposition of tone components on phrase components as in the case of generation process model (F0 model). The tone components are realized by concatenating their fragments at tone nuclei predicted by a corpus-ba...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005